INDUCING VALUABLE RULES FROM IMBALANCED DATA: THE CASE OF AN IRANIAN BANK EXPORT LOANS
نویسندگان: ثبت نشده
چکیده مقاله:
Credit scoring is a classification problem leading to introducing numerous techniques to deal with it such as support vector machines, neural networks and rule-based classifiers. Rule bases are the top priority in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In a credit- scoring context, imbalanced data sets frequently occur as the number of good loans in a portfolio, which is usually much higher than the number of loans that default. The paper is to explore the suitability of RIPPER, One R, Decision table, PART and C 4.5 for loan default prediction rule extraction.A real database of one of Iranian banks export loans is used, and class imbalance issues are investigated in its loan database by random oversampling the minority class of defaulters along with three sampling of majority in non-defaulters class. The performance criterion chosen to measure such an effect is the area under the receiver operating characteristic curve (AUC), accuracy measure and number of rules. Friedman’s statistic is used to test significant differences between techniques and datasets. The results shows that PART is the best classifier in all of balanced and imbalanced datasets
منابع مشابه
Inducing Valuable Rules from Imbalanced Data: The Case of an Iranian Bank Export Loans
Credit scoring is a classification problem leading to introducing numeroustechniques to deal with itsuch as support vector machines, neural networks and rule-based classifiers. Rule bases are the top priority in credit decision making because of their ability to explicitly distinguish between good and bad applicants. In a creditscoring context, imbalanced data sets frequently occur as the numbe...
متن کاملinducing valuable rules from imbalanced data: the case of an iranian bank export loans
credit scoring is a classification problem leading to introducing numerous techniques to deal with it such as support vector machines, neural networks and rule-based classifiers. rule bases are the top priority in credit decision making because of their ability to explicitly distinguish between good and bad applicants.in a credit- scoring context, imbalanced data sets frequently occur as the nu...
متن کاملdata mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولa study on insurer solvency by panel data model: the case of iranian insurance market
the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.
the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 2 شماره 1
صفحات 130- 135
تاریخ انتشار 2013-06-01
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023